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A  ilAUDIO  STSTBC  MUCH  UTILIZES  STATISTICAL  DECISION  FUNCTIONS 

[Following  is  s  translation  of  an  article  by 
S.  F*  Xosubovs'kyy*  in  the  UkminUn-lnngnngs 
periodical  Artematrka  (Autonatiea),  Vol  ?,  Mo  6, 

Instituto  of  Kleotrioal  Engineering ,  Academy  of 
Scisncss,  Ukrainian  SS&,  Il«r,  1962,  page*  60-64, 

This  artiole  was  submitted  to  th«  editora  on 
28  April  1962.3 

The  vigorous  development  of  cybernetics  in  the  last 
decade  has  been  accompanied  by  intensive  use  of  the  most 
diverse  mathematical  tools  for  working  out  and  investi¬ 
gating  automatic  control  systems.  The  theory  of  statisti¬ 
cal  decisions  as  one  of  the  subdivisions  of  probability 
theory  has  been  utilized  successfully  in  the  solution 
cybernetic  problems  along  with  information  theory,  mathe¬ 
matical  logic,  and  the  theory  of  games. 

One  of  the  papers  presented  at  the  winter  session 
of  the  American  Institute  of  Electrical  Engineers  (23  Janu¬ 
ary  to  2  February  1962  in  New  York)  was  devoted  to  the 
application  of  the  theory  of  atat let leal  decisions  in  the 
analysis  and  synthesis  of  cognitive  systems  which  can  learn. 
The  paper  was  read  by  a  worker  of  the  Laboratory  of  Control 
Systems  and  information,  Purdue  University  (Lafayette),  K. 
(S.  Fu»  It  had  the  title  "A  Learning  System  Using  ‘ 
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Jstatistical  Dacia  ion  Functions".  [Saa  Note).  | 

([Nota]  K.  S *  Fu,  "A  Laarnlng  System  Using  Statis¬ 
tical  Decision  Functions,"  presented  at  the  AXES  Winter 
General  Meeting  in  New  York  January  28  -  February  2,  1962*} 

The  paper  contained  a  discussion  of  questions  con¬ 
nected  with  the  analysis  and  synthesis  of  a  learning  system 
which  processes  input  data  in  the  form  of  input  signals 
(input  measurements )  and  so-called  "learning  observations". 
The  system  worked  as  a  cognitive  device*  The  problem  of 
recognition  was  formulated  in  terms  of  the  theory  of  sta¬ 
tistical  decisions.  It  was  shown  in  the  paper  that  tho 
tern  can  improve  its  work  (learn)  by  increasing  the  number 
of  learning  observations. 

A  summary  of  this  paper  is  given  below. 

Introduct ion 

The  word  learning  is  understood  to  be  some  systematic: 
behavior  of  a  system  which  leads  to  certain  forms  of  stabil¬ 
ity  [1).  A  system  which  systematically  varies  its  behavior 
in  accordance  with  a  set  of  input  signals  can  be  regarded  at 
a  system  which  learns.  In  case  the  system  is  well  designed 
its  behavior  Improves  In  accordance  with  one  or  more  cri¬ 
teria.  These  criteria  are  usually  linked  with  decreases  in 

errors  or  with  increases  in  the  stability  of  the  system.  , 

l  _ j 


Tk  model  of  learning  from  encouragement  wae  atudled  in  j 
different  forma.  Syatema  which  learn  or  realize  such  a 
model  are  usually  trained  by  an  external  "trainer". 

This  trainer  selects  and  applies  the  encouragement  factor 
on  the  basis  of  Investigations  of  the  reactions  of  the 
system.  However*  more  autonomous  learning  systems  are 
necessary  in  which  variation  of  their  behavior  will  be 
achieved  by  the  system  itself,  without  any  external  in* 

I  • 

terventlon. 

In  studying  adaptive  control  sysems  we  start  with 
the  idea  that  they  are  capable  of  varying  their  behavior 
to  correspond  with  changes  in  the  characteristics  of  the 
controlled  process  which  cannot  be  foreseen  or  guessed 
[2].  In  this  connection  an  adaptive  system  Is  capabiW  tf 
varying  its  behavior,  for  example,  by  changing  its  imp  ,, 
transfer  function  in  such  a  way  as  automatically  to  recog* 
nlze  a  signal  which  appears  at  random  and  is  subject  to 
noise  [3l. 

When  machines  for  pattern  recognition  are  to  recog¬ 
nize  new  patterns,  it  Is  usually  desirable  to  matce  use  of 
past  experience.  One  of  the  possible  approaches  to  solving 
such  problems  Is  to  design  a  system  which  learns  from  exper¬ 
ience  to  vary  its  behavior  In  accordance  with  changes  in  the 
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Tlnput  signal.  I 

The  learning  process  was  illustrated  in  the  oaper 
by  means  of  a  system  which  operated  as  a  cognitive  or 
classifying  device.  It  is  clear  that  a  machine  which  is 
capable  of  establishing  similarity  in  input  signals 
solve  many  problems  that  must  now  be  solved  by  hv?>.n 
Inge.  Automatic  elaaaif ication  la  determination  >  ;,h 

class  of  signals  at  the  input  of  the  device  on  the 
of  a  set  of  measurements  of  the  input  signal  and  a  scquen 
of  learning  obaervat iona .  Learning  observations  srs  under 
stood  to  mean  measurements  of  a  finite  number  of  possible 
input  signal*  of  each  class.  After  an  input  signal  is  re¬ 
cognized,  a  certain  action  la  performed,  for  example,  a 
change  in  the  system  parameters  in  order  to  increase  its 
stability.  The  losses  which  the  machine  incurs  as  a  result 
of  each  possible  decision  are  evaluated  with  the  aid  of  an 
inaeparable  weighting  function.  The  purpose  of  the  person 
who  receives  the  decision  is  to  minimize  the  probability  r>~ 
looses  in  the  course  of  recognition. 

The  analysis  and  synthesis  of  a  learning  system 
alio  regarded  as  prooeadlng  on  the  basis  of  data  obtained 
by  measuring  tha  input  quantities  and  the  learning  observa¬ 
tions.  The  losses  which  are  unavoidable  in  obtaining  a  auf- 
ixiclant  number  of  accurate  measurements,  the  abstract  J 


I  nature  of  the  properties  which  define  the  class  of  the  in*| 
put  signal  to  the  system,  and  the  statistical  features 
which  are  characteristic  of  measurements  make  It  essen¬ 
tial  to  utilise  the  so-called  statistical  methods  "or  ana¬ 
lyzing  and  synthesizing  systems* 

Although  a  system  is  designed  for  recognition 
certain  class  of  input  signals  in  order  to  perform  a  c<r.  ~ 
tain  action  such  as  varying  the  behavior  of  the  system 
after  acceptance  of  the  decision,  in  general,  however, 
the  measurements  which  are  to  serve  as  the  basia  for  re¬ 
cognition  do  not  provide  enough  information  for  the  class 
of  Input  signal  to  be  recognized  successfully  at  all 
times.  The  statistical  methods  which  are  applied  in  such 
situations  constitute  the  subject  of  the  theory  of  statis¬ 
tical  decision  functions  [4  -  7], 

The  Problem  of  Accepting  Statistical  Decisions 

The  problem  of  accepting  a  statistical  decision  is 
formulated  In  a  fairly  general  form  which  Includes: 

1)  the  state  of  a  priori  knowledge?  2)  the  losses  con¬ 
nected  with  a  possibly  incorrect  decision?  3)  losses  due 
to  experimentation*  The  problem  is  regarded  as  connected 
with  a  sequence  of  values  of  some  random  variable,  let  us 
■say,  x, ,  x2,  ...,  xn  denoted  as  fx}  which  can  be  I 
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Icons  ldered  the  result  of  a  series  of  Measurements  of  th.~1 
Input  signal  which  is  to  be  recognized. 

Let  ili  be  a  parameter  space.  It  Is  subdivided  into 
zones  0)^,  (*>2,  .  ..,ton  where  6^  corresponds  to  the 
class  of  the  correct  signal  at  the  1th  Input.  Here  m 
Is  the  number  of  possible  classes  of  Input  signals  which 
must  be  recognized.  Bach  point  In  the  parameter  apace 
corresponds  to  a  set  of  values  of  the  parameters.  {0}  .  Ve 
shall  uae  to  .denote  the  hypothesis  according  to  which 

some  (0}  lies  in  Bach  set  of  possible  values  of  {x] 

corresponds  to  a  point  in  an  n-dlmensinnal  sample  space  A* 
The  A  space  in  turn  Is  subdivided  into  zones  £2,  . . . » 

...,  here  the  hypothesis  must  be  accepted  only  in 

case  [x]  is  contained  In  The  problem  of  accepting  a 

statistical  decision  must  now  be  defined  as  the  problem  of 
selecting  zones  6g,  In  the  sample  apace  for 

given  zones  u>m  in  the  parameter  space  £2,  . 

Let  P  »  (P^,  P2>  ...»  Pm)  be  an  a  priori  die  tv!1.  • 

bution  defined  on  the  space.  P^  is  the  a  priori  pro¬ 
bability  that  the  input  signal  will  belong  to  class 
The  decision  space  which  is  accepted  by  the  system  for  re¬ 
cognition  Is  composed  of  m  possible  decisions 
d1,  d2,  ...»  dB  where  dj^  Is  the  decision  for  which 
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[hypothesis  Hj  is  accepted.  The  weighting  function  1 
dj)  is  that  loss  Incurred  by  the  system  in  case  *, 
decision  dj  is  accepted  while  the  Input  signal  actually 
belongs  to  the  class  In  case  there  is  a  nonsequen¬ 

tial  process  of  accepting  the  decision  (in  case  some 
correct  d  is  accepted  as  the  decision)  the  conditional 
risk  in  recognizing  Input  signals  that  belong  to  cl:*^s  -  ’ 
can  be  written  in  the  form 


r  («i.  *)  *  J  L  («,,  4)p  (x ><)  dx. 

* 

(i ) 

where  Integration  is  over  the  entire  A  space* 

p(x, 

is  the  conditional  probability  of  x 

on  condi 

tion  that 

tdj.  Is  the  corresponding  true  class  of  the 

measured 

input  signal.  For  a  given  P  the  mean 

risk  is 

equal  to 

/?(P.  *)- 

(2) 

-jp(x)rx(P.  4)4xt 

(3) 

where 

2  d)  PiP 

-  .  f-1 

,w 

% 

(*) 

7 

^is  defined  at  the  a  priori  conditional  risk  of  the  correct! 
decision  d  for  the  given  measurement  x. 

After  that  it  i*  necessary  to  choose  the  correct  do 
clslon  so  as  to  minimize  the  mean  risk  R(P,  d)  or  to 
minimize  the  maximum  of  the  conditional  risk  r(u>j,  d). 

The  decision  rule  which  minimizes  the  mean  risk  la  called 
the  Bayes*  rule.  According  to  (3),  it  is  sufficient  to 
consider  each  x  separately  and  tc  minimize  rx(P,  d). 

In  case  d*  is  Bayes*  rule,  then 

rx(P.  *’)*Zrx{P  * (5) 


For  a  constant  function  the  losses 


L  (•  1, 


0,  /ts/ 

V,  l+j 


and,  without  losing  generality,  when  v  *  1,  it  la  *;*By 
to  show  that  the  risk  function  d  Is  essentially  the  pro¬ 
bability  of  incorrect  recognition.  In  order  to  minimise 
{the  probability  of  incorrect  recognition  it  is  necessary  i 


Fto  choose  m  the  condition  that 
<* /•  j)  >  PjP  (*>y>  f*r  afj  i  +  j. 

In  case  we  define 


hf 


P  W*j> 


H 

.* 

(«i 


(?) 


a*  the  similarity  relation*  then  Eay**1  rule  for  the  de¬ 
dal  on  state  dj,  ito&t  is,  la  accepted  in  case  that 


it/> 


£t 

h 


Ur'  t>n  .  i  +  j. 
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The  Learning  Process  In  the  Recognition  Pro'bli  .• 

In  the  pi  seedling  section  we  examined  the  pr-obi.  - t  -\ 
recognition,  without  learning;,  with  the  aid  of  the  theory 
of  statistical  decistoras*  In  that  sect lan  we  examined  t:h« 
developaent  of  a  Method  for  recognidjcig  a  dess  of  Impuh 
signals  based  on  a  set  of  Measurements  of  the  inpat 
and  a  sequence  of  learraixig  abservationa *  The  process  c<m- 
slaied  of  a  set  of  Measurements  of  the  input  signal 
>:  «  ,  x>,  .  ♦ . ,  x  l  to  be  recognized  and  a  set  of 

measurements  of  pcasibla  Input  signals  of  class 
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[for  each  1*1,  2,  a.  The  last  measurements  which 
are  called  learning  measurements  ara  rapraaanted  in  the 
form  m  *  {  xi  v  Xl  * '  •  •  •  *  Xf  •  •  •  *  *  1  .  The 


i’  . Ai*v 


rule  for  acceptance  of  dec la ion  d  la  a  function  which 
transforms  each  set  of  measurements  (x»  m)  into  an  ae- 
tion  which  follows  recognition  of  the  class  of  the  measured 
input  signal. 

Let 

. . . 

(n) 


where  «  {xl,l»  xif2*  ifi  a  8<>t;  which  is 

composed  of  training  measurements  of  class  while 

/  *«  (/,,  4>'  •  *»  ^*5  \  .i 

denotes  a  set  with  elements  /p  1 2,  4W.  The  rule 

for  accepting  the  decision  on  the  basis  of  measurements 
(x,  a)  is  denoted  by  d(x,  m). 

For  any  decision  rule  d  the  conditional  risk  in 
recognizing  input  signals  which  belong  to  class  is 

equal  to 


L  t«*|,  d)p{x,  mfmi)dx<tm. 


(13) 
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Twhere  P  Is  the  space  of  learning  maasuramenta  m; 
p(x,  m./u^)  la  the  conditional  probability  (x,  m)  In 
case  is  the  true  class  of  the  measured  Input  signal. 

In  case  P  Is  an  a  priori  distribution,  the  mean  rid ;<  is 
equal  to 


Rt(P,  <0- 


-u 


«)  r  (Jf,  *»)(/».  d)dx'.4m, 


(14) 

(15) 


where 


r(jr,  m)  *0 


(*»/.  <0  PiP  (x, 
/>(■*.  <") 


**/«<) 


is  defined  as  the  conditional  risk  of  the  decision  ro.js  d 
for  the  given  measurements  (x,  a). 

If  P  is  fixed  but  unknown,  ard  If  the  learning 
measurements  of  different  classes  is  statistically  unre¬ 
liable,  then 

PiP(x,  —  m.  *i\  md  P  m)  P  (m).  (17) 

Now,  equation  (14)  can  be  represented  in  the  form 
Ri(P.  f /» (m)  V  ^  (“f*  d)pfc'*h  mi)  P{*»Jtn>  dx  tlm.  , 


r 


If  d'  is  Bayesian,  thsn  vs  obtain 


(19) 


m  m 

EL^h  d')PWmi'  •*<)*  (**/"»)  <*^1  (•<,  <t)p{xjmu  mi)P(milm)  (20) 


for  each  set  of  measurements  (x,  m)  and  each  decision 
rule  d.  The  expressions  p(xA>1#  m^)  and  P(«j/m)  will 
be  defined  on  the  basis  of  learning  measurements. 


A  functional  diagram  of  a  learning  system*.  1  -  learning 
measurements  mj  2  -  measurements  x;  3  -  determines '  ona 
of  maximum  amplitude;  4  -  input  signal  whioh  Is  recog¬ 
nized  . 


In  order  to  show  that  a  system  can  learn  or  change 
its  characteristics  In  accordance  with  learning  observe- 
jtlons,  it  is  necessary  to  prove  that  the  Bayesian  risk  — ! 
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[represented  by  equation  (18)  is  a  nonincreasing  functiorj 
of  the  number  of  learning  observations.  In  other  words , 
adding  to  the  learning  observations  does  not  Increase  the 
risk  in  the  system. 

We  have  arrived  at  the  following  theorem:  If 

f2*  *  *2*  •••*  4*  *  for  anx  a  prl or  1 

distribution  q£  p  and.  Ite garnlsn daslslga d*  SEd 

d  *  * 

#,.</».  *'•)<*,(/>.*').  (21) 

where  d ' *  Is  the  Bayes  rule  for  learning  observations  jf* « 
The  theorem  can  be  proved  as  follows.  According  to 
the  definition 

R?(P.  **)<*!•  (A  *  >. 

*  >4 

In  accordance  with  (18) 

/?,.  (P.  rf')  - 1  j^p  (*•)  ’ £ 1  (»J.  W')z>  «j)  p  (*il**)  dxdm*.  (25  ) 

In  view  of  the  fact  that  d1  Is  not  a  function  of 
the  difference 

at*  —  * -•  (x,||+t,. . *1  ,/}*•••*  1  •  *<■,!},)»  (2*0 

-  equation  (23 )  can  be  written  in  the  form 


13  — 


]*  1 

HI. 

Xdxdm-§  j p  (m)  l  (*t-  d')p(x!»i.mi)  P(*rfm)dxdm  ~  Rt{P,  d  ). 


Consequently, 


<!’*)<#,,  (P,  d‘)~Rt(P.d‘). 


!9U  \ 


For  a  constant  function  the  losses 


LC^, 


0,  i  -  j 

1.  1  ♦  J. 


For  learning  observations  Bayes*  rule  becomes  ^ 
P{*ilm)p(x[mi,  m)>  (*/•>'  %)  hn*  tt)i  i  +  i 


A  functional  diagram  of  a  model  which  can  learn  Is  Brown 
In  the  figure* 


Conclusions 

the  design  of  a  learning  system  and  the  possibility 
of  the  system  accepting  decisions  in  accordance  with  exter¬ 
nal  changes  and  learning  observations  we**e  examined.  After 
accepting  a  decision  the  system  makes  use  of  certain  $tepa 
directed  at  improving  its  operation  as  it  accumulates  ex¬ 
perience  . 

I  The  problem  of  recognition  or  classification  !■> 


Tfamulated  in  terns  of  th*  theory  of  stut  Int  leal  decision*!* 
Bayes1  rule  for  recognition  with  learning  Is  determined  an 
a  function  of  the  Input  signal  and  learning  meaaurenerits « 
Learning  measurements  are  utilized  la  calculating;  condi¬ 
tional  probabilities  of  all  unknown  information.  Ih  case 
the  value**  of  the  unknown  parameter  -which  characterised 
each  class  may  be  obtained  from  learning  observations,,  the 
Bayes'  risk  of  a  learning  system  approximates  the  Bayes1 
risk  of  a  system  vlth  a  priori  knowledge  of  Us  general 
characteristics . 

It  -was  shewn  that  the  ajstiaa  can  improve  its  opera¬ 
tion,  or  learn  as  noire  learning  obtser rations  are  provided* 
Add  inf;  to  the  learning  observations  Sees  not  increase  the 
risk  in.  the  systsu* 
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THE  ADVANTAGE  OF  ONE-LAYER  COGNITIVE 


X2ARNING  SYSTEMS 

following  is  ths  translation  of  an  artiols  by  O.  Q*  IvakS- 
nenlco,  Kiev,  in  ths  Ukralnlanwlangnag*  periodLoal  Avto- 
matyfra  (Autonation),  Vol  7,  No  6,  Publishing  House  of  ths 
Aoaaamy  of  Soienoes,  Ukrainian  SSR,  Slav,  1962,  pagas  10- 
19*  Ths  artiols  waa  submitted  to  ths  editors  12  Juno  1962*7 

Systems  Peroelving  Letters*  Syllables.  Words,  and  Phrases 

The  previously  deeeribed  system  (Alpha)  peroelves  (reads)  individ¬ 
ual  letters,  figures,  or  any  other  symbols*  When  letters  are  promote- 
in  ohronologioal  sequenoe  on  the  input  of  the  system  it  is  net  tr  .  sax . 
to  reproduce  them  exactly  as  they  appear  in  print,  nor  is  it  new  r  v, 
to  enunciate  them  as  they  are  pronounced  Individually,  because  it  li: 
produce  distortions  of  the  normal  speech  due  to  the  f act  that  the  oar:1  it  > 
pronunciation  of  each  letter  varies,  depending  on  its  position  among  c  > 
letter*  in  aooordance  with  phonetlo  rule**  Thus,  it  appears  that  sys': 
porosiving  at  onoe  entire  words  or  even  phrases  would  be  nruoh  mere  el 
tlve* 

Amalgamation  of  Certain  Output*  of  the  System  by  Introduction  of  an  Elf 
meni  of  Logioal  "0rw  (Alternate)*  "  * 

When  a  oertain  letter  is  projected  on  the  input  of  the  “Alpha" 
system,  it  produces  a  tension  on  the  output  of  the  corresponding  group 
of  associated  cells  (already  informed).  This  tension  is  oloae  to  lta 
maximum  potentials  45,  60,  55,  50,  60,  eto*  There  are  oertain,  compara¬ 
tively  small,  aberrations  occurring  because  of  the  so-called  "noises" 
produced  by  the  differences  in  the  design  of  a  given  letter*  This  hap¬ 
pens  because  the  voltage  indicator  does  not  respond  to  these  small  vari¬ 
ations  and  therefore  the  entire  system  tends  towards  "generalisation" 
or  "extrapolation"  •  3hua,  the  various  designs  of  the  same  letter  aoquire 
the  seme  nomenclature*  From  this  it  does  not  follow,  however,  that  dif¬ 
ferent  designs  of  the  same  letter  oould  be  combined  to  form  one  oonoep- 
tion*  For  instanoe,  large  print  of  the  letter  "E"  is  entirely  different 
from  the  small  letter  "e",  and  in  order  to  Identify  eaoh  it  would  be 
neoessary  to  use  divert*  groups  of  aesooiated  oella*  Should  "Alpha" 
system  be  adapted  to  a  sound-reproduction  device,  then  both  variations 


of  this  lottoar  ("l*  and  V)  could  bo  oomblnod  by  usihg  a.logloel  suppo¬ 
sition  of  an  alternative  "or**  Id  that  ease  tho  sound  reproduction  for 
both  would  have  an  ldoatiool  sound  of  ■#*•  lho  asms  prooooo  tokos  plaoe 
also  la  tho  husan  wind* 

Optimum  Division  of  tho  Input  Information 

Tho  olio  of  tho  system  oould  bo  determined  by  tho  number  of  tho 
associated  oolls  (relays )  roqulrod  for  Its  functioning*  In  tom  tho  num¬ 
ber  of  oolls  depends  on  tho  voluns  of  information  "rood"  into  tho  eyatem 
and  also  on  tho  methods  of  its  distribution*  Thus,  larger  volume  of  in¬ 
formation  prodioatos  a  larger  also  of  systaau  Syetsma  oapable  of  par** 
ooiving  syllablos  (reading  and  sequencing  thorn  ohronologically)  should 
bo  larger  la  olao  than  system*  reading  only  letters*  Systems  perceiving 
or  reading  a  complete  word  would  be  still  larger,  and  tho  largest  of  all 
would  be  a  system  oapablo  of  reading  eoinplote  phrases* 

At  this  point  wo  wish  to  note  that  this  Tory  fact  reveals  a  po¬ 
tential  possibility  for  eventual  design  of  systems  that  would  be  sore 
accomplished  than  the  human  brain,  -  systems  that  would  perceive  infor¬ 
mation  by  complete  oentenooo  or  even  by  entire  pages  of  text,  -  a  feat 
impossible  for  the  human  brain*  Sow,  lot  us  analyse  what  constitutes 
an  optimum  diviaion  of  information  from  the  point  of  view  of  the  «ys- 
tew»s  slat*  In  praotioal  applioation  this  resulta  in  the  ehoioe  of  the 
number  of  layers  of  associating  oslle* 

Syllables  are  formed  with  letters,  words  -  with  syllables,  phrases 
-  with  words*  Thus,  it  is  possible  to  design  e  four-layer  system*  first 
layer,  of  associating  oella,  will  paroeLve  letter*,  aeoond  -  syllables, 
third  -  words,  and  fourth  -  the  entire  phrase*  In  designing  the  aywfcmr 
it  would  be  possible  to  "omit"  some  of  the  layers  and  thus  oonstrai  a 
system  with  three  layers,  too  layers,  or  even  one  layer* 

Calculation*  proved  that  at  a  given  volume  of  input  infomatlon, 
a  one-layer  system  is  the  moat  eoonomioal,  -  it  requires  a  of 

aesooiating  oells* 

Example 


Let  us  investigate  the  problem  of  optimum  number  of  layers  in 
praotioal  application. 

Assuming  it  is  required  to  oonstruot  a  learning  system  oepeble 
of  oogxdtlon,  as  well  as  reaction  to  phrases  oomposed  of  words  included 
in  the  following  sixteen-word  vocabulary* 


duerpo 

Z?ast7 

V6AMB m 

/5iow7 

BBOpX 

5$ 

sasaA 

/Eaokwarde/ 

BHH3 

Z3own7 

OVtXk 

BJteso 

£ef$7 

no 

snipe# 

/forward/ 

QMp*OT» 

/?V"g 

snpaBo 

^ight/ 

CHH3HTb 

^Towerij^/ 

rpy3 

cron 

/stog/ 

OTHraxb 

^Bove/ 

4 

yBMtmrtb 

increase/ 

Let  us  also  assume  that  a  system  would  be  optimum  in  size  if  it 
had  a  minimum  of  associating  cells*  In  making  our  comparison  we  shall 
make  the  following  assumptions »  a)  No  more  than  12  symbols  are  neces¬ 
sary  to  identify  an  individual  letter)  b)  the  longest  word  in  the  vocab¬ 
ulary  is  ynemironb  /uveliohit*)  increase/  and  it  has  nine  letters  and 
four  syllables)  o)  all  words  are  oompoeed  from  an  alphabet  of  24  letters 
(in  Russian))  d)  vocabulary  consists  of  the  sixteen  above  mentioned 
words)  e)  the  longest  phrase  consists  of  four  words,  and  the  number  of 
informative  phrases  is  20  (only  20  informative  phrases  are  read  into  V 
input))  f )  the  number  of  syllables  in  the  vocabulary  is  30* 

Let  us  prooeed  now  with  calculations  of  the  number  of  cells  re¬ 
quired  for  different  variations  of  the  system  in  order  to  determine  the 
optimum  number  of  layers  from  the  point  of  view  of  the  size  of  the  sys¬ 
tem*  The  number  of  cells  in  each  layer  is  determined  according  to  the 
formula* 

N  ■  nUta 

where  n  -  is  the  number  of  symbols  used  depending  on  the  ays tern* a  capac¬ 
ity  for  division)  24  -  factor  of  all  possible  separate  images)  m  -  fac¬ 
tor  of  all  utilized  separate  images  (there  is  a  number  of  images  that 
are  perceived  simultaneously)*  For  instance,  for  one  layer  of  system 
(Alpha)  reading  consecutively  33  letters  of  the  alphabet,  we  obtain  the 
following*  -  N  -  12  X  33  X  1  -  396  oella. 

Systems  That  do  not  Store  the  Parotlved  Information 

For  the  purpose  of  brevity  we  shall  limit  ourselves  to  the  conr 
structlon  of  a  system  that  reads  only  one  word*  Figure  1  is  a  diagram 
of  a  two-layer  system  capable  of  reading  first  the  letters  and  then 


words*  A  number  of  oells  la  this  system  equals 

1  layer 

2  layer  ' 


The  number  of  oells  in  eeoh  layer  Is  determined  according  to  the 
formula*  N  *  alia  where  a  -  iq  the  naaber  of  eyaibols  used  depending  on 
the  system's  oapeoity  for  division}  V  -  factor  of  all  possible  separate 
images}  m  -  factor  of  all  utUlaad  separate  images  (there  is  a  number 
of  Images  that  are  perceived  simultaneously)  *  For  Instance,  for  one 
layer  of  system  (Alpha)  reading  consecutively  33  letters  of  the  alpha¬ 
bet,  we  obtain  the  following*  -H -12*  33  *1-396  oells. 


12  X  24  X  9  -  2592 
216  X  16  X  1  -  3456 


items  That  do  not  Store  the  Peroeived  Information 


For  the  purpose  of  brevity  we  shall  limit  ourselves  to  the  con¬ 
struction  of  a  system  that  reads  only  one  word.  Figure  1  is  a  diagram 
of  a  two-layer  system  capable  of  reading  first  the  letters  and  then  vox 
4  number  of  oells  in  this  system  equals 


1  layer 

2  layer 


12  X  24  X  9  -  2592 
216  X  16  X  1  -  3456 


T 


u  v  e  1  i  oh  i 


increase/ 


1S>  2  i  £  i 

i _ 


ix, HiH 


IM  1  | 


lndloator  of  waadmua  tension 


J  uveliofait'  /pLacrhnJ 


Figure  1*  Two-layer  system  perceiving  first  letters  then  words. 
fth*  numbers  1-46  designate  the  words  similarly  numbered  on  page  19*7 
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Figure  2  ia  another  variant  of  a  two-layer  system  perceiving 
syllables  and  then  words#  The  number  of  cells  in  this  case  is* 

1  layer  60  X  30  X  4  -  7200 

2  layer  120  X  16  X  1  -  1920 

Total  9120  cells 


Figure  2,  A  two-layer  system  that  reads  first  syllables  then  ".'oris 

Designations  are  the  sane  as  in  Figure  1. 

Finally,  Figure  3  is  a  one-layer  system  that  reads  the  entire 
word  at  onoe*  This  system  uses 

108  X  16  X  1  -  1728  cells 

This  oroves  that  the  one-layer  system  is  in  fact  an  optimum  system  from 
the  point  of  view  of  size  (providing  the  system  is  not  required  to  store 
the  ii'iorr.ation)# 
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Figure  3.  One-layer  system  which  reads  the  entire  word  at  onoe. 

Designations  are  the  same  as  in  Figure  1. 


Let  us  calculate  the  number  of  sells  required  for  a  one-layer  - 
tem  perceiving  any  of  the  twenty  possible  different  coroiar.&i  that  .oul 
be  given  by  using  any  of  the  above  mentioned  sixteen  words. 

N  -  (108  I  4)  X  20  I  1  -  8640  cells 
A.  System  That  Gould  Read  Shakespeare 

Let  us  assume  that  we  wish  to  design  a  system  capable  of  perceiv'- 
ing  all  the  words  in  Shakespeare's  vocabulary  (considering  the  longest 
word  to  be  of  10  letters,  and  the  vocabulary  to  be  of  30,000  words). 

Such  case  would  require 

120  x  30,000  XI  -  3,6.106  ©ells 

In  contrast  to  a  man  this  system  is  capable  of  reading  an  entire 
phrase  at  once  which  makes  it  superior  to  the  human  brain. 


In  order  to  discriminate  between  all  the  phrases  that  could  be 
used  in  instructing  people  and  adding  to  it  all  the  possible  variations 


of  t bam  (such  u  grammatieel*  forimstaaee)  It  would  bo  neeeamary  to  pro* 
vido  this  system  with  many  mare  eajiVsi  than  ax*  available  la  the  human 
brain*  But  this  system  thaqp  ualdUmisum  would  bo  ospahlt  of  reading 
entire  phrases  at  onoo  (and  not  rood  by  qOlablo*  or  words  as  nan  does)* 
This  explains  the  foot  that  the  eoianee  of  blsnoadaaa  at  the  present 
time.  Is  trying  to  solve  the  problem  of  prodnoing  inexpensive  binary 
micro  elements  and  develop  methods  for  designing  systems  with  total  num¬ 
ber  of  associating  calls  in  the  ardar  of  101®* 

Systems  with  Capacity  for  Storing  Perceived  Information 

% 

lhat  are  the  methods  for  redoodag  the  number  of  required  cells? 
These  methods  are  suggested  to  us  by  nature  itself*  To  begin  with  the 
reading  systems  with  any  numbar  of  layers  of  associating  cells  oould  not 
read  the  entire  information  at  onoo*  they  read  progressively*  in 
and  use  the  short-lived  ability  of  memorising  the  text  as  man  does* 
is  known  from  the  general  theory  of  oooaunicatiena*  say  alleviations 
the  process  of  the  logout  of  information  tend  to  simplify  the  entire  • 
tern* 


Introducing  special  memorising  devioes  permits  a  sharp  reductic  . 
in  the  number  of  associating  oalls*  This  fact  oould  easily  be  proven 
by  examples* 

la  the  "Alpha"  system  the  33  letters  in  the  alphabet  could  bo 
read  either  by  using  33  groups  of  associating  cells*  acting  in  parallel , 
(as  we  have  demonstrated  previously)  or  by  SMttnff  KrSingle  group  which 
is  continuously  learning  and  relearning  (with  the  help  of  the  first 
additional  and  reversible  link)  letters  that  are  projected  on  the  input 
of  the  system*  Information  thus  obtained  is  transmitted  to  the  memo¬ 
rizing  device*  Identification  of  the  letters  is  carried  out  by  means 
of  their  comparative  analysis  in  the  output  group  and  also  in  the  memo¬ 
rizing  devioe  (l*e*  with  the  help  of  the_  cut-off  reversible  devloe)* 

For  instance,  if  a  sequence  of  symbols  already  projected  on  the 

input  of  the  system,  has  bean  assigned  number  "1"*  then  at  any  repetition 
of  the  same  sequence  It  would  be  stored  again  under  number  "1"*  In  a 
system  that  perceives  sixteen  words  (Tig*  1*  2*  3)  utilisation  of  memory 
devices  also  permits  use  of  only  one  system  "Alpha",  oapable  of  reading 
letters  in  chronological  sequence* 

In  Fig*  1,  2,  3,  dotted  lines  indicate  that  part  of  the  system, 
which  remains  under  maximum  exploitation  of  the  memorizing  devloe*  Bach 
layer  retains  only  one  system* 

It  oould  be  readily  proven*  that  in  oaae  of  memorizing  inforrar*- 
tlon,  the  one  layer  system  also  requires  a  minimum  of  associating  cells* 
It  is  also  evident  that  it  will  work  freer  and  more  often  than  two  or 
three  layer  systems  (beoauee  it  will  require  more  operations  of  reading 


and  memorizing)* 


In  this  respect  a  compromise  solution  considering  not  only  the 
size  of  the  system,  but  also  the  speed  of  its  operation,  is  suggested 
to  us  by  nature  itself*  It  is  impossible  for  a  man  to  read  a  oomplete 
sentence  at  onoe,  thus,  evidently  (considering  what  has  been  said  before 
about  reading  individual  letters)  an  optimum  system  -would  be  a  system* 
that  would  read  syllables  or  words  with  a  transitory  memorizing  of  the 
text*  In  this  fashion  we  would  secure  a  satisfactory  solution  for  opti¬ 
mum  volume  of  information,  to  be  introduced  into  the  system  simultan¬ 
eously*  At  the  same  time  we  would  be  able  to  determine  the  optimum 
division  of  this  information  into  parts,  having  in  mind  not  only  the 
size,  but  also  the  speed  of  the  system*  However,  leaving  out  the  ele¬ 
ment  of  speed,  and  considering  only  the  size,  the  one-layer  system  ap¬ 
pears  to  be  the  most  advantageous,  regardless  of  whether  memorizing  de¬ 
vices  are  available  or  not,  providing  that  the  entire  input  of  the  sys¬ 
tem  is  read  as  a  whole  and  not  in  parts* 

Systems  Employing  an  Element  of  Probability 

Another  method  of  reducing  the  size  of  the  system  is  based  on  the 
use  of  an  element  of  probability*  We  are  apt  to  guess  the  meaning  of 
the  word  by  looking  only  at  its  beginning,  or  the  end  (sometimes  the 
middle)*  In  this  fashion  we  also  achieve  a  reduction  in  the  volume  of 
required  information,  which  at  the  same  time  results  in  the  reduction 
of  the  number  of  required  associating  oells*  Considering  that  nature 
evidently  selected  an  optimum  variant  in  the  creation  of  the  human  brain 
(regarding  the  economy  of  required  elements)  we  would  assume  that  the 
best  system  is  that,  which  reads  the  text  by  syllables  (memorizing  it), 
and  takes  into  account  the  human  faculty  to  make  assumptions  on  the 
basis  of  only  partial  information  (subconscious  mind)* 

Development  of  the  optimum  sizes  of  systems  is  only  beginning. 
There  could  be  no  doubt  that  in  the  years  ahead  optimum  and  invariant 
systems  will  be  designed  (i*e*  instantaneous  cognitive  learning  syntax' 
that  excell  the  speed  and  the  oapaoity  of  the  human  brain* 

Methods  for  Calculating  "Resolving  Capaoity"  of  the  Cognitive  System 

One  of  the  least  developed  problems  is  that  of  the  choice  of  num¬ 
bers  of  associating  cells  in  each  group  -  n^. 

Hie  larger  n^  is,  the  bigger  the  number  of  identifying  symbols 

is,  and  the  greater  the  "resolving  oapaoity"  of  the  system  is*  At  the 
same  time  it  also  lessens  the  possibility  of  error  in  redistribution  of 
images,  ^ee  Note7  To  be  sure,  besides  the  number  of  symbols  it  is 

(/N°te7  To  be  more  eocaot,  with  the  increase  in  the  number  of  symbols,  used 
the  "resolving  capaoity"  either  grows  or  remains  stable*) 
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MOtmy  to  calculate  also  the  usefulness  and  the  stability  of  the  aya- 
beSfeg'  Sat  ua  attempt  to  derive  a  mathematical  evaluation  of  the  "resolv¬ 
ing  oapaoity*  in  order  to  utilise  it  in  the  selection  of  the  nwber  of 
associating  oells  for  the  group  r^.  The  number  of  group  m  equals  the 

number  of  .  individual  iaagee  (for  example*  the  number  of  letters  in  the 
alphabet)*  Tension  at  the  output  of  the  group  fe-ol  equals 

"  "  l-l 

1  </<»i,  Kk<m. 


Indioator  of  maximum  tsnslon  (voltage  indicator)  equalizer  these 
tensions  in  pairs  (after  a  oertain  order)  and  it  is  able  to  register  oho 
maximum  of  tension  only  if  it  is  sufficiently  different  from  the  tensic  ; 
appearing  at  the  output*  Therefore,  we  evidently  shall  be  in  a  position 
to  evaluate  "the  resolving  oapaoity*  by  the  value  of  R*  whioh  depends  on 
the  smallest  differ enoe  between  the  two  highest  tensions  in  output 

X,,  Sit—*  2+  when  taken  in  pairs*  and  from  the  threshold  of  the  worked 

out  indioator  of  maximum  tension*  aooordlng  to  formula  — 

where  is  a  threshold  of  the  worked  out  output  relays*  AS  ,^n  1* 

the  minimum  difference*  selected  from  the  two  largest  among  the  output 
tensions*  When,  for  example*  R  *  3*  it  means  that  the  system  has  a 
necessary  reserve  of  "resolving  oapaoity"  with  the  given  number  of  asso¬ 
ciating  oells  n^  and  the  after  effects  v^  1  ^ 

Numerical  example*  We  shall  designate  "resolving  capacity"  of 
the  system*  >hioh  (in  a  learned  oondition)  has  the  actual  after-effects 
(with  the  number  of  oells  in  the  group  n  *  12)* 

o4«*l-  +  l  +  l-l-l-t-.l  +  l- 1-1-1  + 1-1. 

~  1  - 1  + 1  + 1  + 1  - 1  -  I  -  1  + 1  + 1  + 1  + 1 . 

»Aa*a,*»  — 1  — l  +  l  +  l  +  l  +  l  — 1  — l  +  l  +  l  +  l  +  l 

Using  the  above  formula*  we  find  that  in  projecting  image  A*  ten¬ 
sions  of  the  group  are  equaling 

v,m+!2  unit*  .  2»--6  •  unit*  I*»-2  unit*  Aga»+|4  unit* 

In  projecting  image  B  we  obtain  accordingly 


Zi-~6  unit,  St* +13  unit. 
In  projecting  image  C  - 


2,-0  unit,  AXo+13  unit* 


Ii«-2  unit,  z,*o  unit,  .  Z*-+lt  unit,  A|«+|f  unit* 

The  smallest  of  the  differeneee  in  tensions,  registered  by  the 
voltage  indicator,  is  equal  to  AZ  ^  «  12  unit* 

Assuming  the  threshold  of  the  -worked  out  indicator  to  be 

uav  “  10volt  on®  unit  of  tension  is  equal  to  5  volts  sal  than  the  "re-* 
solving  capacity*  will  be  _  aI  12*5  _ 

’  %L  "  10 

The  system  has  a  sixfold  reserve*  Repeating  a  number  of  snob 
calculations  with  the  ohange  of  the  assooiating  cells  n^  it  is  easy 

to  see  that  the  "resolving  capacity*  of  the  system  decreases  when  n^ . 
becomes  smaller*  The  smaller  n^,  the  smaller  the  "resolving  oapaeity* 

(Fig*  4)  When  8  <  1,  the  system  oeases  to  diaoem  the  images  (Voltage 
indicator  does  not  work)* 


These  ootOLd  be  need  for  rational  selection  of  the  ‘ 

motor  of  in  a  group  In  order  to  eeanr#  the  given  reserve  of 
"resolving  oapadty*  -  H  for  the  system*  In  passing,  let  ns  note  that 
another  additional  Unit  eoold  be  achieved  In  such  a  way  that  with  a 
constant  motor  of  oells  In  the  group  n^  a  selection  ooold  be  aade 

from  the  factor  of  no  symbols  that  would  provide  greatest  "resolving 
oapadty**  The  prompting  in  this  oase  should  ooae  from  the  counting 
devioe  that  would  determine  B  In  aooordanoe  with  the  above  rules* 


Figure  4*  Approximation  of  relationship 
between  the  "resolving  capacity"  of  the 
cognitive  system  and  the  number  of  asaoda- 
ting  oells  in  a  group* 

The  Principles  of  Designing  Systems  far  Control  of  Complex  Prope  r •  w 

As  ie  known  (2)  the  present  system  of  control  in  complex  prc-  ■»*  v 
could  be  divided  into  the  following  parts* 

1)  Means  and  methods  of  peroeiving  situation  (identification) 

2)  Logic- taking  decision 

3)  Means  and  methods  of  influencing  the  object  (aotivizatlon) 

4)  Object  of  control 

These  four  parts  represent  a  system  of  control,  which  ooul&  bs 
developed  by  contemporary  theory  of  control,  including  the  theory  of 
invariants*  Dispersion  analysis  tells  us  that  the  resolving  the  capac¬ 
ity  of  the  system  provides  the  most  effective  influence  on  the  object* 

In  more  simple  oases  any  stabilisation  system  oould  be  designed  complete¬ 
ly  on  the  basis  of  the  above  given  schemes*  Control  and  regulation  of 
thi*  size  helps  in  solving  the  problem  of  identification*  The  logio  of 
it  is  that  the  ability  to  compare  the  regulated  product  with  its  purpose 


helps  in  discovering  the  cense  of  deviation*  Servomotor  and  booster  help 
to  solve  the  problem  of  aetivlsatlon* 

AH  these  factors  together  with  the  object  of  regulation  implement 
a  well  known  system  of  stabilisation  of  the  regulated  quantities*  The 
problem  of  identification  at  the  new  contemporary  level  is  solved  by  a 
battery  of  devloes*  The  above  described  resolving  systems  also  oould 
be  used  for  solving  that  problem*  In  fact,  it  is  not  difficult  to  see 
that  the  problem  of  disoerning  letters  is  not  essentially  different  from 
the  problem  of  discerning  other  situations  that  arise,  for  example,  in 
the  industrial  production* 

let  us  give  a  very  simple  example*  A  tanker  comes  into  port  for 
delivery  of  its  cargo  of  oil*  The  ship*s  presence  at  a  given  berth  in 
the  warf  is  given  a  symbol  +  1  unit,  absence  -  1  unit,  ship  filled  to 
a  certain  level  +  1  unit,  below  that  level  -  1  unit,  etc*  Hie  number 
of  the  output  reactions  would  correspond  to  the  number  of  the  possible  * 

situations j  naturally,  some  of  them  could  be  combined  through  the  use 
of  the  logical  alternative  "OR",  and  would  still  produce  the  desired 
reaction  of  the  system*  We  wish  to  point  out  again  that  control  systems  * 
regarding  complex  processes,  whether  In  their  entirety  or  in  parts,  ft- 
the  purpose  of  managing  an  operation,  could  be  carried  out  as  an  open 
system,  as  a  closed  system,  or  as  a  combination  of  both* 

An  Open  System  of  Management  consists  of  an  identifying  part,  a 
logloal  decision  part,  a  oontrol  pert  and  an  object  of  management  not 
connected  with  the  identifying  part*  A  system  of  that  sort  could  be 
only  a  cognitive  learning  system,  because  the  logloal  decision  in  all 
its  particulars  must  be  given  by  a  man*'  For  each  output  of  the  identi¬ 
fying  part  of  the  system,  a  certain  reaction  should  be  Introduced  into 
the  system  by  a  nan  (In  order  to  influence  the  object)*  Iherefore,  re¬ 
gardless  of  the  fact  that  the  identifying  part  of  the  system  could  have 
a  self-learning  capacity,  the  entire  system  still  could  be  an  open  one 
and  lack  that  magic  quality* 

Closed  System  of  Management  differs  from  the  previous  by  the  fact 
that  its  identifying  part  isoloaely  tied  to  the  object  of  management* 

The  existence  of  a  dosed  circuit  and  a  return  connection  does  not  per  v 

se  indicated  self-sufficient  and  self-organised  system*  Self-organisa¬ 
tion  (or  self-learning)  in  a  closed  system  takes  plate  only  wham 

♦ 

1)  it  selects  and  puts  in  operation  the  algorisms  that  oouple 
the  identifying  pert  to  the  possible  reactions  of  the  system,  and  man*;: 
part  consists  only  in  indicating  the  purpose  of  the  actionf 

2)  it  carries  out  this  selection  and  improves  upon  ltj 

3)  under  certain  conditions  it  also  finds  the  rational  purpose 
of  its  action* 
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Let  us  analyse  these  problem*  of  aslf-organl  sation  om  by  out*  . 

«,  w  kjyc  SSJi 

simplified  examples  there  ere  two  ywttli  bertha  fsr  the  oil  Trying 
tanlc«r»  The  controlling  system  oanri.es  out  the  unloading  of  the  ship* 

The  following  situations  are  possiMLaii  a)  both  berths  are  freer  b)  ths 
first  berth  is  oooupdad*.  o)  the  leeosi  berth  Is  oooupied*  d)  both  berths 
ars  oooupiad*  Ths  simplest  identifying  system  should  have  four  outputs 
corresponding  to  ths  number  of  ths  sjtaatissjji  Oonsidsring  the  loading 
and  unloading  costs*  pooping  expanses*  stsMrehs  number  of  the  possible 
reaotione  oould  be*errldatttla||anah  grsate^xhas  ths  nuober  of  the  out¬ 
puts  of  ths  identifying  s]rstea%  besauss  ths  system  oould  take  into 
oount  previous  history,  and  rsaot  to  ths  oooM  notion  of  outputs*  Hov^- 
ever,  ths  possibls  asariisan  number  of  reactions  oannot  exceed  the  total 
number  of  possibls  outputs,  and  thsy  prevlds  ths  ultimate  figure*  Thu?, 
all  the  variants  of  loading  and  unloading  oould  be  calculated  on  the 
counting  aaohinoa  or  on  the  models  establishing  oequenoes  of  reactions 
(algorisms  of  action),  which  establishes  •toe  eactoemeg  of  any  index  (for 
lnstanoe,  minimum  of  unloading  ooste)  or  shows  the  optimum  relationship 
between  a  number  of  indioes,  (for  accample,  optimum  relationship  between 
the  speed  end  the  ooet  of  unloading)* 

The  purpose  of  the  action  in  such  a  system,  i*e*  demonstration  . 
of  extreme  or  the  optimum  relationship,  is  indioated  by  man  and  ths 
role  of  the  system,  oonslsts  in  riKleulctlng  and  bringing  about  ths  op¬ 
timum  method  of  reaching  this  purpose  by  means  of  oomputing  and  comper¬ 
ing  all  the  possible  variants* 

2,  Systems  with  Self-Improvements  in  Algorisms  for  Finding 
Optimum  Roaotions.  Systems  with  constant  computation  of  all  possit 
variations  on  tEe  oount ing  maohinee  are  accomplishing  only  the  first 
problem  of  self-organisation*  Self-organisation  of  the  system  oould  be 
improved,  if  we  oould  add  the  oapaoity  for  improvement  to  it  on  ths  basis 
of  its  past  perform&nos* 

It  should  be  remembered  that  besides  a  computation  and  comparison 
of  all  the  possible  variants,  we  oould  also  introduce  statistical  re¬ 
search  to  achieve  the  first  satisfactory  result*  In  suoh  a  oase  compu¬ 
tation  and  comparison  oonosrns  only  those  variants  which  are  closest  to 
the  very  extreme* 

s 

Self-learning  systems  with  statistical  research  oompute  the  re¬ 
sults  of  the  previous  performances  end  test  those  decisions  that  appear 
to  be  the  most  promising*  They  rejeet  unsatisfactory  ones  and  do  not 
seek  answers  in  areas  proven  to  be  inappropriate*  Self-learning  in  a 
system  helps  it  make  proper  decisions  faster  and  better*  Self-learning 
systems  seeking  the  optimum  or  the  extreme  are  eharsoterlaed  by  auto¬ 
matically  improving  the  algorisms  of  search  for  their  objectives*  This 


la  achieved  by  adding  another  system,  of  a  diff  erent  order  to  the  exist¬ 
ing  one,  and  treating  the  latter  aa  an  object  of  the  eeoond  (hierarohy 
of  the  extreme  systems)*  Self-learning  results  in  a  speedier  and  better 
work  of  the  system  that  commands  "the  knowledge  of  experience"  •  The 
words  "faster  and  better"  indioate  that  with  reference  to  both  we  ere 
confronted  either  by  the  manifestation  of  statistioal  approach  or  by  the 
supplemental  return  connection  whioh  is  always  present  when  we  deal  with 
the  avalanohe  type  prooeas  of  growth  of  some  physical  quantities*  In 
this  case  action  of  the  supplemental  return  oomwotion  leads  to  a  con¬ 
tinuous  increase  in  speed  of  the  system's  action  and  its  accuracy* 

Thus  the  second  of  the  three  above  mentioned  problems  of  self- 
organization  is  solved  with  the  help  of  statistioal  research  or  supple¬ 
mental  return  connections* 

3*  Systems  with  Research  Regarding  the  Purpose  of  Self-Organiza¬ 
tion*  It  is  possible  to  make  tbe  syshem  more  perfeot  by  adding  to  It 
$Ka~ capacity  for  searohing  and  finding  the  goal  of  self-organization* 

Let  us  pose  a  question  here*  Is  it  possible,  in  principle,  to  remove 
man  from  his  ultimate  role,  the  role  of  establishing  the  systems'  goals 
or  changing  them?  In  principle  this,  evidently,  ie  quite  possible*  But 
then  again  you  have  to  consider  the  process  of  evolutionary  struggle, 
death  and  survival*  Those  systems  that  survived  either  by  ohanoe  or  f. 
any  other  reasons,  suooeeded  in  doing  so  beoauae  they  have  found  ^ore- 
appropriate  aims  and  have  transmitted  them  down  the  line*  This  is 
evidently  the  process  that  took  pl&ee  in  nature,  as  a  result  of  which 
we  have  -the  existing  species  of  animal  and  plant  life*  Beth,  as  is  well 
known,  possess  well  designed  management  systems* 

It  is  evident  that  in  designing  management  systems  for  industrial 
purposes,  knowledge  of  systems'  alms  does  not  oreate  a  problem  because 
the  aims  of  management  are  very  clear  and  could  be  determined  by  men 
without  any  difficulty*  The  rest  of  self-organization  could  be  carried 
out  by  the  system  itself* 
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